fix: add clean-data playbook to prevent storage-related deployment failures #81

KatyaRyazantseva · 2026-01-06T23:45:25Z

While running devnet-1, one of the client nodes reached its storage limit. A new ansible deployment failed when copying genesis files to the server due to insufficient storage. To restart the devnet, manual cleaning was required. This PR adds centralized data cleaning that runs before genesis generation when using the --cleanData flag.

The clean-data playbook can be called independently for cleaning up node data directories. It supports all deployment modes: site.yml, deploy-nodes.yml, and tag-based execution.

Scenarios tested on Lighthouse server

# Primary use case (via spin-node.sh)
NETWORK_DIR=local-devnet ./spin-node.sh --node lighthouse_0 --deploymentMode ansible --useRoot --cleanData

# Direct ansible with site.yml
./ansible-deploy.sh --node lighthouse_0 --clean-data

 #With deploy-nodes.yml
./ansible-deploy.sh --playbook deploy-nodes.yml --node lighthouse_0 --clean-data

# With tags
./ansible-deploy.sh --node lighthouse_0 --network-dir ansible-devnet --tags lighthouse --clean-data

# Independent cleaning (no deployment)
ansible-playbook -i ansible/inventory/hosts.yml ansible/playbooks/clean-data.yml \
    -e "genesis_dir=$(pwd)/ansible-devnet/genesis" \
    -e "node_names=lighthouse_0"

ch4r10t33r

We appear to be double cleaning.
When clean_data=true is passed via site.yml, data gets cleaned:

First in clean-data.yml (step 1)
Then again in each role's tasks
Should we consider using clean-data.yml for pre-deployment cleaning alone and remove role level cleaning when using site.yml?

ch4r10t33r · 2026-01-09T18:52:45Z

ansible/roles/lantern/tasks/main.yml

    msg: "Node key file {{ node_name }}.key not found in {{ genesis_dir }}"
  when: not (node_key_stat.stat.exists | default(false))

+- name: Check if node data directory has contents


Can we extract this into a shared task file under roles/common/tasks/clean-node-data.yml and use include_tasks in each role.

- name: Clean node data if requested include_tasks: "{{ playbook_dir }}/../roles/common/tasks/clean-node-data.yml" when: clean_data | default(false) | bool

This will modularize the code (instead of copying the same content across 5 different roles (which is likely to increase in future).

ch4r10t33r · 2026-01-09T18:54:11Z

ansible/roles/lantern/tasks/main.yml

    path: "{{ data_dir }}/{{ node_name }}"
    state: absent
-  when: clean_data | default(false) | bool
+  when:


The find task to check if a directory has contents before deletion is redundant. Ansible's file: state=absent is idempotent. It will succeed whether the directory exists, is empty, or has contents.

ch4r10t33r · 2026-01-09T18:57:45Z

ansible/playbooks/clean-data.yml

+
+    - name: Extract all node names
+      shell: |
+        yq eval '.validators[].name' {{ validator_config_file }}


This is a nit, but can we please add a check to validate if yq is installed before using it here?

KatyaRyazantseva · 2026-01-13T22:23:13Z

We appear to be double cleaning. When clean_data=true is passed via site.yml, data gets cleaned:

First in clean-data.yml (step 1)

Then again in each role's tasks
Should we consider using clean-data.yml for pre-deployment cleaning alone and remove role level cleaning when using site.yml?

This one was tricky. Both site.yml and deploy-nodes.yml work as independent playbooks and support the --cleanData flag, so both need cleaning. Correct me if I'm wrong.

When a server runs out of storage, cleaning must be the first remote task for all deployment playbooks. Otherwise, Ansible fails to create a temp folder on the server:

[ERROR]: Task failed: mkdir: cannot create directory /root/.ansible/tmp/ansible-tmp-1767979903.397822-19056-15485771047564: No space left on device

Currently, deploy-nodes.yml will fail with full storage. The common role runs first and tries to install packages. Then the individual client roles run tasks like "Extract node configuration" before they reach the cleaning tasks. All these tasks need temp space, so they fail before cleaning ever runs. I can add cleaning as the first task in deploy-nodes.yml (before the common role) and use a skip_role_cleaning flag when it's already done in site.yml. This avoids duplication, and we can delete cleaning on the role level.

KatyaRyazantseva · 2026-01-13T22:25:29Z

Should we move helper files like deploy-single-node.yml into a separate folder (e.g., playbooks/utils/ or playbooks/helpers/) to make it clear which playbooks are top-level and can be run independently?

KatyaRyazantseva added 2 commits January 7, 2026 00:39

fix: add clean-data playbook to prevent storage issues

f100371

Merge branch 'main' into clean-data-ansible

82fd7df

ch4r10t33r reviewed Jan 9, 2026

View reviewed changes

KatyaRyazantseva added 2 commits January 13, 2026 23:36

fix: use raw module for cleaning

8a0129f

Merge remote-tracking branch 'upstream/main' into clean-data-ansible

734e2a3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: add clean-data playbook to prevent storage-related deployment failures #81

fix: add clean-data playbook to prevent storage-related deployment failures #81

KatyaRyazantseva commented Jan 6, 2026 •

edited

Loading

Uh oh!

ch4r10t33r left a comment

Uh oh!

ch4r10t33r Jan 9, 2026

Uh oh!

ch4r10t33r Jan 9, 2026

Uh oh!

ch4r10t33r Jan 9, 2026

Uh oh!

KatyaRyazantseva commented Jan 13, 2026

Uh oh!

KatyaRyazantseva commented Jan 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: add clean-data playbook to prevent storage-related deployment failures #81

Are you sure you want to change the base?

fix: add clean-data playbook to prevent storage-related deployment failures #81

Conversation

KatyaRyazantseva commented Jan 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Scenarios tested on Lighthouse server

Uh oh!

ch4r10t33r left a comment

Choose a reason for hiding this comment

Uh oh!

ch4r10t33r Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

ch4r10t33r Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

ch4r10t33r Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

KatyaRyazantseva commented Jan 13, 2026

Uh oh!

KatyaRyazantseva commented Jan 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

KatyaRyazantseva commented Jan 6, 2026 •

edited

Loading